-
Notifications
You must be signed in to change notification settings - Fork 21
[CI/Build][Intel] Enable benchmarks on Intel Gaudi 3 runner #94
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Hi @jakub-sochacki! Thank you for your pull request and welcome to our community. Action RequiredIn order to merge any pull request (code, docs, etc.), we require contributors to sign our Contributor License Agreement, and we don't seem to have one on file for you. ProcessIn order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA. Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with If you have received this in error or have any questions, please contact us at cla@meta.com. Thanks! |
|
Thank you for signing our Contributor License Agreement. We can now accept your code for this (and any) Meta Open Source project. Thanks! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks good to me. thanks
| pip install -r .github/scripts/requirements.txt \ | ||
| --extra-index-url https://download.pytorch.org/whl/rocm6.3 | ||
| elif [[ "${DEVICE_NAME}" == "hpu" ]]; then | ||
| grep -v "^torch==" .github/scripts/requirements.txt > /tmp/requirements_no_torch.txt |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nit: Is there a way to write something as follows in the requirements file?
torch; platform_system == 'Linux' and platform_machine != 'hpu'
Probably no, I haven't seen this syntax before, so just want to check
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think we need to install pytorch here now, but let me follow up on this in a separate PR to just remove torch from requirements.txt
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Alright @huydhn, so please let me know if you removed torch from requirements there in other PR. Then i will remove the line which filters out torch for gaudi.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@huydhn , this change is to make sure we don't re-install torch. Since HPU is using special torch version. I hope it is OK to leave the grep -v "^torch==" .github/scripts/requirements.txt > /tmp/requirements_no_torch.txt just in case.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! Plz rebase and I can help land this
linux.hpu.gaudi3.8runner to benchmark matrix (placeholder)test-[throughput | latency | serving]-hpu.jsonfiles with benchmark configurationsgaudi3to default runners list in workflow dispatchhl-smicommandvllm/last-good-commit-for-vllm-gaudibranch to get compatibility history (N most recent VLLM_STABLE_COMMIT updates)The commit selection mechanism solves the race condition where VLLM_STABLE_COMMIT might change between CI image builds and benchmark runs (every 12 hours), ensuring benchmarks always find an existing compatible Docker image.